EXPLICIT GROWTH AND EXPANSION FOR SL 2 



EMMANUEL KOWALSKI 



Abstract. We give explicit versions of Helfgott's Growth Theorem for SL2, as well as 
of the Bourgain-Gamburd argument for expansion of Cayley graphs modulo primes of 
subgroups of SL2(Z) which are Zariski-dense in SL2. 
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1. Introduction 

Our main goal in this paper is to prove the following result, which is an explicit version 
of a theorem of Bourgain and Gamburd [1] : 

Theorem 1.1. Let S C SL 2 (Z) be a finite symmetric set such that the subgroup generated 
by S is Zariski-dense in SL 2 (Z). Let 7 be the set of primes such that S p = S (modp) 
generates SL 2 (F P ), which contains all but finitely many primes. Then the family of Cayley 
graphs (C(SL 2 (F p ), S p )) pe y is an expander family, and one can write down explicit bounds 
for the spectral gap, given the set S. 

In particular, if S generates a free group of rank \S\/2, the spectral ga$\ satisfies 



(1.1) Ai(e(SL 2 (F p ),S p )) ^2 

for all p large enough, where 

log(^v^) 



2 35 7" 



7 



logmax se5 ||s|| 

the norm \\s\\ being the operator norm of the matrix s, with respect to the euclidean metric 
onC 2 . 



We can specify what u p large enough" means, but we defer a statement to Section 14.31 
since this involves a series of inequalities which are awkward to state (and unenlightening) , 
but easy to check for a given concrete set of matrices S. 



2010 Mathematics Subject Classification. 20F69, 05C50, 05C81. 

Key words and phrases. Growth of finite groups, expander graphs, Cayley graphs, diameter, random 
walks on groups. 

1 This is the spectral gap of the normalized Laplace operator A = Id — M, where M is the Markov 
averaging operator of the graph; thus the spectrum of A is a subset of the interval [0, 2]. 
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A crucial ingredient in the argument of Bourgain and Gamburd is Helfgott's Growth 
Theorem [TT] for SL 2 , which has considerable independent interest. We thus require an 
explicit version of it, and we will prove the following: 

Theorem 1.2. Let p be a prime number, H C SL 2 (F p ) a symmetric generating subset 
of SL 2 (F p ) containing 1. Then the triple product set = H ■ H ■ H satisfies either 
H { V = SL 2 (F P ) or 

\H^\>\H\ l+ \ 

where 5 = 1/3024. 

Here is a simple corollary, which is (as far as the author is aware) also the first explicit 
result of this kind for almost simple linear groups: 

Corollary 1.3 (Explicit solution to Babai's conjecture for SL 2 (F P )). For any prime 
number p and any symmetric generating set S o/SL 2 (F p ) ; we have 

diame(SL 2 (F p ),5) ^ 3 (log | SL 2 (F P )|) C 

with C = 3323. 

Another corollary of Helfgott's Theorem and of intermediate results used in the proof 
of Theorem 11.11 is a better diameter bound for Zariski-dense subgroups: 

Corollary 1.4 (Diameter bounds for Zariski-dense subgroups of SL 2 ). Let S C SL 2 (Z) 

be a finite symmetric set such that the subgroup generated by S is Zariski-dense in SL 2 (Z) 
and is a free group of rank \S\/2. Let 7 be the set of primes such that S p = S (modp) 
generates SL 2 (F P ). 

Let 5 > be as in Helfgott's Theorem and define 

t -1 = logmax llsll > 0. 

s&S 

Then for p G CP and p > exp(2/r) ; we have 

diam(e(SL 2 (F p ),5)) ^ 3 A (\og | SL 2 (F P )|) 

where 

_ log(8r-\\S\ - I)' 1 ) 
log(l + 5) 

Remark 1.5. Using the well-known bound 
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Ai(r p ) ^ 



\S\ diam(r p ) 2 

(see, e.g., [El Th. 13.23]), these diameter bounds can be used to get lower bounds for 
spectral gaps for "medium" primes. Note the huge discrepancy however at the end of the 
range. 



Combining Theorem 1 1 . 1 1 wit h the second corollary, we can give explicit statements for 
the motivating example of the Lubotzky group. 

Corollary 1.6 (The Lubotzky group). Let 



o i y \±3 i 

and let T p = C(SL 2 (F P ), S p ). Then we have 

(1.2) Ai(r p ) ^ 2- 23 



if p ^ 2 246 , and 

diam(r p ) < 2 5572 (log | SL 2 (F F 
for all p 7^ 3. 

The original papers of Bourgain and Gamburd pQ and Helfgott [H] are effective, and 
thus it is not surprising that one can obtain explicit versions. What is less clear is how 
good the constants may be, and how much work may be required to provide them. This 
paper gives a first indication in that respect. 

The bounds we derive are very unlikely to be anywhere near sharp, and not only be- 
cause we often use rather rough estimates to simplify the shape and constants appearing 
in various inequalities 3 Indeed, when the Hausdorff dimension of the limit set of the 
subgroup G generated by S is large enough, Gamburd [8] has shown quite good spectral 
gaps for the hyperbolic Laplace operator on G\H, which strongly suggest that the corre- 
sponding combinatorial spectral gap would be also relatively large. But this computation 
has not been done, to the author's knowledge, and our version of Theorem 11.11 gives the 
first fully explicit spectral gap for infinite-index subgroups of SL 2 (Z), with Corollary 11.61 
being a nice concrete example (it is also known that the "Lubotzky group" is too small 
for Gamburd's result to apply). 

In view of the direct link between the spectral gap of families of Cayley graphs of quo- 
tients of "thin" (or sparse) subgroups of arithmetic groups and quantitative applications 
of sieve methods to these groups, it is natural to wish for a better understanding of these 
issues^ A first step towards effective versions of these applications of "sieve in orbit" 
would be to extend Theorem 11.11 to an effective spectral gap for SL 2 (Z/gZ), where q is 
a squarefree modulus (as originally proved by Bourgain, Gamburd and Sarnak [2]), and 
we hope to come back to this. 

As a final remark, the reader can also see this paper as presenting a complete proof of 
the qualitative forms of Theorems 11.11 and 11.21 and their corollaries. When read in this 
light, ignoring the fussy technical details arising from trying to have explicit bounds, it 
may in fact be useful as a self-contained introduction to this area of research. 

Notation. As usual, \X\ denotes the cardinality of a set. Given a group G, and a 
symmetric generating set S, we denote by Q(G, S) the Cayley graph of G with respect to 
S, which is jS^-regular. Moreover, we say that a symmetric set S C G freely generates G 
if representatives of S modulo the relation s ~ s _1 form a free generating set of G, i.e., 
G is a free group of rank 15*1/2. 

For a subset H C G of a group G, we write for the n-fold product set 

H (n) = {x G G | x = hi---h n ioi some hi G H}. 
Note the immediate relations 

(#(")) = }j( nm ) ^ ff( n + m ) — . 

for n, m ^ and (H^)~ 1 = if H is symmetric. In addition, if 1 G H, we have 
H(n) c H (m) for all 

m ^ n. In particular, the diameter of a Cayley graph G(G, H), when 
H = H- 1 , is the smallest n ^ 1 such that H {n) = G, where H = H U {1}. 
We denote by trp(if) the "tripling constant" of a subset H C G, defined by 



V) 



trp(if) 



\H\ 



2 In some cases, one can easily extract better bounds from the proof, e.g., one can replace 1/3024 by 
1/1513 for all H large enough in Theorem 1 1.2 1 

3 Indeed, this question was asked by J- P. Serre during the author's Bourbaki lecture |14j . 
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2. Explicit multiplicative combinatorics 



Another ingredient of Theorem 11.11 is the relation between subsets of a finite group 
with small "multiplicative energy" and sets with small tripling constant, or approximate 
subgroups. This was obtained by Tao p4j , in good qualitative form, but without explicit 
dependency of the various quantities involved. In this section, we state a suitably explicit 
version. 

We recall first the definitions involved. For a finite group G and A, B G G, one defines 
the multiplicative energy by 

E(A, B) = \{(a h a 2 , h, b 2 ) e A 2 x B 2 | ai&i = a 2 b 2 }\. 

It is also convenient to denote by 

e(A B) - lE{AB)l 

e[AB) ~ (\A\\B\y/*- 

the normalized multiplicative energy, which is ^ 1. Following Tao (see [2U Def. 3.8]), for 
a finite group G and any a ^ 1, a subset H C G is an a-approximate subgroup if 1 G H, 
H = H _1 and there exists a symmetric subset X C of order at most a such that 

(2.1) H-HcX-H, 
which implies also H ■ H C H • X. Then we have: 

Theorem 2.1. Let G be a finite group and a ^ 1. If A and B are subsets of G such 
that e(A,B) ^ a -1 , there exist constants fa, fa, fa, fa^-1, a fa-approximate subgroup 
H C G and elements x, y G G such that 

|H| ^ f3 2 \A\ <: f3 2 a 2 \B\, 
\AHxH\ ^ ^-\A\, \BH Hy| ^ -^r\B\, 

P3 P3 

trp(H) *C 4 , 

and moreover Pi ^ c±a C2 for some absolute constants c\, c 2 > 0. In fact, one can take 

(2.2) ft ^ 2 1861 a 720 , fa ^ 2 32 V 26 , fa ^ 2 2424 a 937 , fa ^ 2 93 V 60 . 

Except for the values of the constants, this is proved in [2H Th. 5.4, (i) implies (iv)] 
and quoted in [261 Th. 2.48]. Since this is obtained by following line by line the arguments 
of Tao, we defer a proof to the Appendix. 



Lecture notes for this course are available, and contain more background and motivating material [15] . 
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3. Growth for SL 2 



We prove here Theorem 11.21 The argument we use is basically the one sketched by 
Pyber and Szabo in [211 §1-1] (which is expanded in their paper to cover much more 
general situations). It is closely related to the one of Breuillard, Green and Tao [I], and 
many ingredients are already visible in Helfgott's original argument |llj . 

3.1. Elementary facts and definitions. We begin with an important observation, 
which applies to all finite groups, and goes back to Ruzsa: to prove that the tripling 
constant of a generating set H is at least a small power of \H\, it is enough to prove that 
the growth ratio after an arbitrary (but fixed) number of products is of such order of 
magnitude. 

Proposition 3.1 (Ruzsa). Let G be a finite group, and let H C G be a symmetric 
non-empty subset. 

(1) Denoting a n = \H^\/\H\, we have 

(3.1) a n «C a™" 2 = trp{H) n - 2 

for all n ^ 3. 

(2) We have tvp(H^) ^ trp(i/) 4 and for k ^ 3, we have 

trp(H {k) ) <: trp(#) 3fc - 3 . 
Proof. The first part is well-known (see, e.g., [TT| Proof of Lemma 2.2]). For (2), we have 

trp (tf«) = 2**. 

Since a k ^ a 3 for k ^ 3, we obtain trp(i7 (fc) ) < af"' 3 for k ^ 3 by (1), while for k ^ 2, 
we simply use ct-i ^ 1 to get trp(if( 2 )) ^ ol\. □ 

We first use Ruzsa's Lemma to show that Helfgott's Theorem holds when \H\ is small, 
in the following sense: 

Lemma 3.2. Let G be a finite group and let H be a symmetric generating set of G 
containing 1. If + G, we have \H® \ ^ 2 1 ' 2 \H\. 

Proof. If the triple product set is not all of G, it follows that ^ H^ 2 \ We fix some 
x £ — H^ 2 \ and consider the injective map 

i ■ i H ~^ G 
1 h i — Y hx 

The image of this map is contained in H ^ and it is disjoint with H since x 
Hence which contains H and the image of z, satisfies \H^\ ^ 2\H\. Hence, by 

Ruzsa's Lemma, we obtain 

□ 

Remark 3.3. In fact, as the referee pointed out, a better result is known (and is elemen- 
tary): if H C G generates G then 2^ ^ \\H\ (see [25]). 

The following version of the orbit-stabilizer theorem will be used to reduce the proof 
of lower-bounds on the size a set to an upper-bound for another. 
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Proposition 3.4 (Helfgott). Let G be a finite group acting on a non-empty finite set X . 
Fix some x G X and let K C G be the stabilizer of x in G. For any non-empty symmetric 
subset H C G, we have 

\Kr\H^\> lH 



\H-x\ 



: 



where H ■ x = {h ■ x \ h G H}. 

(Note that since H is symmetric, we have 1 G K fl ii( 2 ).) 

Proof. As in the classical proof of the orbit-stabilizer theorem, we consider the orbit map, 
but restricted to H 

' H — > X 
h i — y h ■ x 

Using the fibers of this map to count the number of elements in H, we get 

i/e*(/r) 

But the image of (f> is 0(ii) = H ■ x, and we have 

l^- 1 ^)! < \Kr\H {2) \ 

for all y (indeed, if y = <f>(ho) with h G ii, then all elements h E H with 0(/i) = y satisfy 
h^he Kn H^). Therefore we get 

|#| ^ |#-x||if n# (2) |, 

as claimed. □ 

Finally, a last lemma shows that if a subset H has small tripling constant "in a sub- 
group", then H itself has small tripling (in the language of approximate groups, it is a 
special case of the fact that the intersection of two approximate groups is still one). 

Lemma 3.5. Let G be a finite group, K C G a subgroup, and H C G an arbitrary 
symmetric subset. For any n ^ 1, we have 

|#(n+l)| \H^DK\ 



\H\ \H^r\K\' 

Proof. Let X <Z G/ K he the set of cosets of K intersecting if: 

X = {xK G G/K | xK n H ^ 0}. 

We can estimate the size of this set from below by splitting H into its intersections 
with cosets of K: we have 

\H\ = E \HHxK\. 

xKex 

But for any xK G X, fixing some g G xKdH, we have g^go G KC\H^> if g G xKDH, 
and hence 

|xKni/| ^ |^n# (2) |. 

This gives the lower bound 



|irn/J( 2 )|' 

Now take once more some xK G X, and fix an element xk = h G rciT fl ii. Then all 
the elements xkg are distinct for g G if, and they are in xif fl if*™ 4 " 1 ) if g G K fl if( n \ so 
that 

\xK n# (n+1) | > IK n# (n) | 



for any xK G X, and (cosets being disjoint) 

^ \X\\KnH^\, 

which gives the result when combined with the lower bound for \X\. □ 

We will use classical structural definitions and facts about finite groups of Lie type. In 
particular, a regular semisimple element j6G = SL^Fp) is a semisimple element with 
distinct eigenvalues. The centralizer of such an element is a maximal torus in G. For 
any subset if C G, we write H reg for the set of the regular semisimple elements in H. A 
maximal torus T C G = SL^Fp) is the intersection GnT, where T is a maximal torus 
of G which is stable under the Frobenius automorphism a. Here are the basic properties 
of regular semisimple elements and their centralizers; these are all standard facts, and we 
omit the proofs. (For general facts about finite groups of Lie type, one may look at [S] 
or [3 Ch. 1 and 3], and for conjugacy classes in SL 2 (F P ), one may look for instance at [3, 
p. 70]; another source for SL 2 is [221 Ch. 6]). 

Proposition 3.6. Fix a prime number p and let G = SL2(F p ) ; G = SL2(F P ). 

(1) A regular semisimple element x G G is contained in a unique maximal torus T, 
namely its centralizer T = Cg(x). In particular, if Ti ^ T 2 are two maximal tori, we 
have 

(3.2) 

- 1 - l,reg ' 1 - 1 - 2,reg V- 

(2) //TcG is a maximal torus, we have 

IT I = IT — T 1 = 2 

\ s -nreg\ | - 1 - - L reg\ ^- 

(3) For any maximal torus T, the normalizer Ng(T) contains T as a subgroup of index 
2. Similarly, for any maximal torus T C G, Ng(T) contains T as a subgroup of index 2, 
and in particular 

2(p-l) ^ \N G (T)\^2(p + l). 

(4) The conjugacy class Cl(g) of a regular semisimple element g G G is the set of 
all x G G such that Tr(x) = Tr(g). The set of elements in G which are not regular 
semisimple is the set of all x G G such that Tr(x) 2 = 4. 

Finally, (a variant of) the following concept was introduced under different names and 
guises by Helfgott, Pyber-Szabo, and Breuillard-Green-Tao. We chose the name from the 
last team. 

Definition 3.7 (A set involved with a torus). Let p be a prime number, H C SL 2 (F P ) 
a finite set and T C SL2(F P ) a maximal torus. Then H is involved with T, or T with 
H, if and only if T is a-invariant and H contains a regular semisimple element of T 
with non-zero trace, i.e., H D T sreg ^ where the superscript "sreg" restricts to regular 
semisimple elements with non-zero trace. 

Remark 3.8. The twist in this definition, compared with the one in j2TJ or [J], is that 
we insist on having non-zero trace. This will be helpful later on, as it will eliminate a 
whole subcase in the key estimate (the proof of Proposition 13 . 12j) . and lead to a shorter 
proof, with better explicit constants. However, this restriction is not really essential in 
the greater scheme of things, and it would probably not be a good idea to do something 
similar for more general groups. 

The alternative = SL2(F p ) in Helfgott's growth theorem will be obtained as a 
corollary of the Gowers-Nikolov-Pyber "quasi-random groups" argument (see [10] and |19j). 



Proposition 3.9. For a prime p ^ 3, if a subset H C SL 2 (F P ) satisfies 

\H\^2\SL 2 (F P )\ 8/9 , 

we have = SL 2 (F P ). 

For a proof, see, e.g., [15j §4-5]. 

3.2. Escape from subvarieties and non-concentration lemmas. Two important 
tools in the proof of growth theorems for linear groups are estimates for escape from 
subvarieties and for non-concentration in subvarieties. We state and prove in this section 
the special cases which we need for the explicit proof of Helfgott's Theorem. The reader 
may wish to look only at the statements and skip afterwards to the next section to see 
how they are used. 

Lemma 3.10 (Escape). Let p ^ 7 be a prime number and let H C SL 2 (F P ) be a sym- 
metric generating set with 1 G H . Then Hsrlg ^ 0, i-e., the three-fold product set 
contains a regular semisimple element x with non-zero traced In particular, there exists 
a torus T = Cg(i) involved with . 

The general non-concentration inequalities are now often called "Larsen-Pink inequal- 
ities", since the first versions appeared in the work of Larsen and Pink [17] on finite 
subgroups of linear groups. "Approximate" versions occur in the work of Hrushovski [12] 
and Breuillard-Green-Tao [4], with closely related results found in that of Pyber and 
Szabo [21] • 

Theorem 3.11 (Non-concentration inequality). Let p ^ 3 be a prime number and let 
g G SL 2 (F P ) = G be a regular semisimple element with non-zero trace. Let Cl(g) C 
SL 2 (F p ) = G be the conjugacy class of g. If H C G is a symmetric generating set 
containing 1, we have 

(3.3) |C%)nff| ^ 7a 2/3 |#| 2/3 
where a = trp(_ff) is the tripling constant of H , unless 

(3.4) a > {HI 1 / 28 . 

From this last fact, we will deduce the following dichotomy, which is the precise tool 
used in the next section to prove Helfgott's Theorem. 

Proposition 3.12 (Involving dichotomy). (1) For all prime number p, all subsets H C 
SL 2 (F p ) and all maximal tori T C SL 2 (F p ), ifT and H are not involved, we have 

\H(1T\ 4. 

(2) If p ^ 3 and H C SL 2 (F P ) = G is a symmetric generating set containing 1, we 
have 

(3.5) |T reg n H {2) \ ^ U~ l a- A \H\ 1/3 

for any maximal torus T C SL 2 (F p ) which is involved with H, where a = tip(H), unless 

(3.6) a ^ \H\ l/l68 . 



The condition p J? 7 is sharp, see [15, Example 4.6.13] for an example. 
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Proof. (1) is obvious, since |T — T reg | ^ 2 and there are also at most two elements of 
trace in T (as one can check quickly). 

For (2), we apply the orbit-stabilizer theorem. Let T = T R G be a maximal torus in 
G. Fixing any g G T reg , we have T = Cc(g), the stabilizer of g in G for its conjugacy 
action on itself. We find that 

(3.7) |Tn# (2) |^ 





H 




\{hgh-i | 


h G H}\ 



for any symmetric subset H. Since H is involved with T, we can select g in T sreg fl if in 
this inequality, and the denominator on the right becomes 

{{hghT 1 | heH}\^ |# (3) n C%)| ^ |# {3) n Cl{g)\ 

where Cl(g) is the conjugacy class of g in G. Applying the Larsen-Pink inequality to 
H( 3 \ with tripling constant bounded by a 6 (by Ruzsa's Lemma), we obtain the lower 
bound 

unless a = trp(ff) ^ |if I 1 / 168 . In the first case, we get 

\T reg nH {2) \ ^ u~ l or A \H\ l/ *, 

unless 

7- 1 oT 4 |#| 1 / 3 ^ 2 

since there are only two elements of T n H ^ which are not regular. This last alternative 
gives 

a > ±|#| 1/12 

which we see is a stronger conclusion than ( 13. 6 p (precisely, it is strictly stronger if \H\ > 
2 13 , but in the other case the lower bound trp(if) ^ v2 from Lemma 13.21 is already a 
better result.) Hence Proposition 13.121 is proved. □ 

Now we prove the escape and non-concentration results. 

Proof of Lemma \3.1(A The basic point that allows us to give a quick proof is that the set 
iV C SL2(F P ) of elements which are not regular semisimple is invariant under SL2(F P )- 
conjugation, and is the set of all matrices with trace equal to 2 or —2. It is precisely the 
union of the two central elements ±1 and the four conjugacy classes of 

u= (o i)' ^("o 1 -i)' M '=(o i)' ^('o 1 -i 

(where e G F^ is a fixed non-square) while elements of trace are the conjugates of 

(0 1 

(these are standard facts, which can be checked on the list of conjugacy classes in [3 p. 
70], for instance.) 

We next note that, if the statement of the lemma fails for a given H, it also fails for 
every conjugates of H, and that this allows us to normalize at least one element to a 
specific representative of its conjugacy class. It is convenient to argue by contradiction, 

(3) 

though this is somewhat cosmetic. So we assume that HnAg is empty and p ^ 7, and will 
derive a contradiction. 



We distinguish two cases. In the first case, we assume that H contains one element of 
trace ±2 which is not ±1. The observation above shows that we can assume that one of 
u, v, u', v' is in H. We deal first with the case u G H . 

Since if is a symmetric generating set, it must contain some element 



with c 7^ 0, since otherwise, all elements of H would be upper-triangular, and H would 
not generate SL 2 (F P ). Then contains ug, u 2 g, u~ 1 g, u~ 2 g, which have traces, re- 
spectively, equal to Tr(g) + c, Tr(g) + 2c, Tr(g) — c, Tr(g) — 2c. Since c 7^ 0, and p is not 
2 or 3, we see that these traces are distinct, and since there are 4 of them, one at least is 
not in {—2,0,2}, which contradicts our assumption. 



If v! G H, the argument is almost identical. If v! (or similarly v') is in H, the set of 
traces of (u') j g for j G {-2, -1, 0, 1, 2} is 



and one can check that for p ^ 5, one of these is not 0, —2 or 2, although some could 
coincide (for instance, if Tr(g) = 2, the other traces are {2 + 2c, —2 — c, —2 + c, 2 — 2c}, 
and if c - 2 = 2, we get traces {2, -6, 10}, but -6 £ {0, 2, -2} for p ^ 5). 

In the second case, all elements of H except ±1 have trace 0. We split in two subcases, 
but depending on properties of F p . 

The first one is when —1 is not a square in F p . Conjugating again, we can assume that 
g G H. Because H generates SL 2 (F P ), there exists g G H which is not ±1, ±g - If 



is such an element, we have a 7^ 0, since otherwise b = — c 1 and the trace of g^g is c+c 1 , 
which is not in {—2,0,2} (non-zero because —1 is not a square in our first subcase), so 
HnXg ^ 0, contrary to the assumption. Moreover, we can find g as above with b 7^ c: 
otherwise, it would follow that H is contained in the normalizer of a non-split maximal 
torus, again contradicting the assumption that H is a generating set. 
Now we argue with g as above (i.e., a 7^ 0, b 7^ c). We have 



with non-zero trace t — c—b. Moreover, if t — 2 , i.e., c = 6+2, the condition det(g g) = 1 
implies 



or (6 + l) 2 = —a 2 . Similarly, if t = —2, we get (b — l) 2 = —a 2 . Since a 7^ 0, it follows 
in both cases that —1 is a square in F p , which contradicts our assumption in the first 
subcase. 

Now we come to the second subcase when — 1 = z 2 is a square in F p . We can then 
diagonalize go over F p , and conjugating again, this means we can assume that H contains 




{Tr(<?) + 2c, - Tr(<?) - c, Tr(g), - Tr(g) + c, Tr(g) - 2c}, 





_26 - b 2 - a 2 = 1 




as well as some other matrix 
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g" 



(the values of a, b, c are not the same as before; we are still in the case when every 
element of H has trace except for ±1). 

Now the trace of g' Q g' G is 2za. But we can find g' with a ^ 0, since otherwise H 
would again not be a generating set, being contained in the normalizer of the diagonal 
(split) maximal torus, and so this trace is non-zero. 

The condition 2za = ±2 would give za = ±1, which leads to —a 2 = 1. But since 
1 = det(g') = —a 2 — be, we then get be = for all elements of H. Finally, if all elements 
of H satisfy 6 = 0, the set H would be contained in the subgroup of upper-triangular 
matrices. So we can find a matrix in H with b ^ 0, hence c = 0. Similarly, we can find 
another 

'a 

in H with c^O. Taking into account that z 2 = —1, computing the traces of g'g" and of 

g' g'g" gives 

be — 2, bcz 

respectively. If be = 2, the third trace (of an element in H^) is 2z {0,2,-2} since 
p / 2, and if be = 4, it is 4z ^ {0, 2, —2} since p ^ 5. And of course if be ^ {2, 4}, the 
first trace is already not in {—2, 0, 2}. So we are done... □ 

For the proof of Theorem 13.111 we will use the method suggested by Larsen and Pink 
at the beginning of [T7 S §4]. We consider the map 

Cl(j)xCl(j)xCl( 5 ) — > G x G 

(xi,x 2 ,x 3 ) ^ (xiX 2 ,XiX 3 ) 

and we note that for (x 1 ,x 2 ,x 3 ) G (Cl(g) fl H) 3 , we have (j)(xi,x 2 ,x 3 ) G x H^ 2 \ We 
then hope that the fibers _1 (?/i,y 2 ) of <fi are all finite with size bounded independently 
of (2/1, 2/2) G G x G, say of size at most c\ ^ 1. The reason behind this hope is that 
Cl(#) 3 and G 2 have the same dimension, and hence unless something special happens, 
we would expect the fibers to have dimension 0, which corresponds to having fibers of 
bounded size since everything is defined using polynomial equations. 

If this hope turns out to be justified, we can count | Cl(g) fl H \ by summing according 
to the values of 0: denoting Z = (Cl(g) n Hf and W = <j>{Z) = <f)((C\(g) n H) 3 ), we 
have 

\ci( g )nH\ 3 = \z\= \r\yuy2)nz\ 

(yi,V2)£W 

which - under our optimistic assumption - leads to the estimate 

I Cl(g) n H\ 3 ^ Cl \W\ < Cl |# {2) | 2 < c ia 2 |#|, 

which has the form we want. 

To implement this - and solve the complications that arise -, we are led to analyze 
the fibers of the map 4>. The resulting computations were explained to the author by R. 
Pink, and start with an easy observation: 

Lemma 3.13. Let k be any field, and let G = SL 2 (fc). Let C C G be a conjugacy class, 
and define 

C 3 — > G 2 

(xi,X 2 ,X 3 ) ' y (XiX 2 ,XiX 3 ) 

Then for any (yi,y 2 ) G G x G , we have a bijection 

GV\y x G- 1 V\y 2 G- x — > ^\y Xj y 2 ) 

x x (x 1 ,x 1 1 yi,x 1 1 y 2 ) 
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In particular, if k = F p and C is a regular semisimple conjugacy class, we have a 
bijection 

_1 (2/1,2/2) ^cn yi cn y2 c. 

Proof. Taking xi as a parameter, any (xi, x 2 , x 3 ) with 0(xi, x 2 , x 3 ) = (yi, y 2 ) can certainly 
be written (xi, x± yi, x± 1 yi)- Conversely, such an element in SL 2 (/c) 3 really belongs to 
C 3 (hence to the fiber) if and only if x\ G C, a^f J/i G C, x^ l y 2 G C, i.e., if and only if 
X\ G C PI y\C~ l PI y 2 C~ l , which proves the first part. 

For the second part, we need only notice that if C is a regular semisimple conjugacy 
class, say that of g, then C = C~ l because g~ x has the same characteristic polynomial 
as g, hence is conjugate to g. □ 

We are now led to determine when an intersection of the form C D y\C fl y 2 C can be 
infinite. The answer is as follows, and it is one place where the use of the infinite group 
SL 2 (F P ) is significant: 

Lemma 3.14 (Pink). Let k be an algebraically closed field of characteristic not equal to 
2, and let g G SL 2 (/c) be a regular semisimple element, C the conjugacy class of g. Fory\, 
y 2 G G, the intersection X = C fl y\C fl y 2 C is finite, containing at most two elements, 
unless one of the following cases holds: 

(1) We have y 1 — 1, or y 2 — 1 or y x = y 2 . 

(2) There exists a conjugate B = xBqx -1 of the subgroup 



B ° = {(o „-i)}cSL,<*) 



and an element t G B fl C such that 

(3.8) yi ,y 2 eUUt 2 U 

where 

U = xU x-\ U = {(J jj}cB„. 

In that case, we have X C C fl B. 
(3) The trace of g is 0. 

The proof will be given at the end of this section: it is mostly computational. Before 
coming back to the proof of Theorem 13 .111 we state and prove another preliminary lemma, 
which is another case of non-concentration inequalities. 

Lemma 3.15. For a prime p and 7 G F£ , define 



C i ~ \ ( 1 ) \ teF P 



For any p ^ 3, any 7 G F* ; any x G SL 2 (F p ) and any symmetric generating set H of 
SL 2 (F P ) containing 1, we have 



\H fl xCjX 



7 t 
7- 



#naH [ ; . _! I t G F p 



^ 2a 2 \H\ 1/3 



where a = trp(if). 



Proof. We first deal with the fact that x and 7 are not necessarily in SL 2 (F p ). We 
have xC~{X~ x fl SL 2 (F P ) C xHqx^ 1 fl SL 2 (F P ), and there are three possibilities for the 
latter: either xBox -1 fl SL 2 (F P ) = 1, or xHqx^ 1 fl SL 2 (F P ) = T is a non-split maximal 
torus of SL 2 (Fp), or xB x _1 fl SL 2 (F P ) = B is an SL 2 (F p )-conjugate of the group B = 
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B CI SL 2 (F p ) of upper-triangular matrices (this is once more a standard property of 
linear algebraic groups over finite fields; the most direct argument in this special case is 
probably to observe that we only need to know that xHqX^ 1 fl SL 2 (F p ) is a subset of a 
maximal torus, or of a conjugate of B, which follows from the fact that this intersection 
is a solvable subgroup of SL 2 (F P )). 

In the last case, we can assume that x G SL 2 (F P ) and 7 G F p . In the first, of course, 
there is nothing to do. And as for the second, note that 7 and 7 _1 are the eigenvalues 
of any element in SL 2 (F P ) fl xC^x -1 , and there are at most two elements in a maximal 
torus with given eigenvalues. A fortiori, we have I if n xC^x- x \ ^ 2 < 2a 2 \H I 1 / 3 in that 
case. 

Thus we are left with the situation where x G SL 2 (F P ). Using SL 2 (F p )-conjugation, it 
is enough to deal with the case x — 1. Then either the intersection is empty (and the 
result is true) or we can fix 

So = (2 7 -i) e H n C v 
and observe that for any g G if fl C 7 , we have 

hence 

|ff nc 7 | ^ |ff (2) nd| = |ff (2) nu |, 

which reduces further to the case 7 = 1. 

In that case we have another case of the Larsen-Pink non-concentration inequality, in 
that case in a one- dimensional variety. There is here also a rather short proof: we fix any 
element h G if such that h is not in B , i.e. 



h = 



c d 



ij : 



with c ^ 0. This element exists, because otherwise if C B fl SL 2 (F P ) would not be a 
generating set of SL 2 (F P ). 

Now consider the multiplication map 

U* x U* x U* — > G 

(u l7 u 2 ,u 3 ) 1 y u x hu 2 hr 1 u ?> 

where U* = U — 1 (we explain below why we do not use Uq as domain). 

Note that since h G if, we have i/>((U* fl if( 2 )) 3 ) C H^ 8 \ Crucially, we claim that for 
any x G G, the fiber ifj~ 1 (x) is either empty or reduced to a single element! If this is 
true, we get as before 

|U*nff( 2 f^ \H®\ ^a 6 \H\, 

and therefore 

|u n ff (2) | = |u* n ff (2) | + 1 ^ 2« 2 |ff| 1/3 , 

which is the result. 

To check the claim, we compute. Precisely, if 



a matrix multiplication leads to 



/l - tit 2 c 2 - t 2 ac * 

V>(«l,ti2,«3) = ( _^ c 2 ^ 



13 



and in order for this to be a fixed matrix x, we see that t 2 (i.e., u 2 ) is uniquely determined 
(since c ^ 0). Since u 2 is in U*, it is not 1, and this means that t 2 7^ (ensuring this is 
the reason that ip is defined using U* instead of U ). Thus ti (i.e. U\) is also uniquely 
determined, and finally 

u 3 = {u\hu 2 hT )~ 1 x 

is uniquely determined. □ 

Proof of Theorem \3 . 1 1\ We have g regular semisimple with Tr(g) ^ 0. We define as 
above the map 4> and denote 

z = (c\(g) n H)\ w = <j>{z) = 0((c%) n h) 3 ), 

so that 

(3.9) \c\( g )nH\ 3 = \r 1 (yuy2)nz\ = s + s 1 + s 2 , 

(yi,V2)eW 

where Si denotes the sum restricted to a subset Wi C W, Wo being the subset where the 
fiber has order at most 2, while Wi, W 2 correspond to those (yi,y 2 ) where cases (1) and 
(2) of Lemma 13.141 hold. Precisely, we do not put into W 2 the (yi,y 2 ) for which both 
cases (1) and (2) are valid, e.g., y\ = 1, and we add to W\ the cases where y\ = —1, 
which may otherwise appear in Case (2). We will prove: 

S ^ 2\H^\ 2 <: 2a 2 \H\ 2 , S t ^ 4\H^\ 2 ^ 4a 2 \H\ 2 , 

S 2 ^ 32a u ^\H\ 5 / 3 . 

Assuming this, we get immediately 

| C%) n H\ ^ Q 2 / 3 a 2 ' 3 \H\ 2 / 3 + 2^ 3 a 3 ^ 9 \H\ 5 / g 

from (13.91) . Now either the second term is smaller than the first, and we get (13. 3p (since 
2-6 2 / 3 <7),or 

2 5/3 a 34/9| ff |5/9 > Q^/S^ |2/3 > 2 ^ a ^\ H \ 2 / 3 , 

which gives 



a > 



m l/2S 



the second alternative (13. 4p of Theorem 13.111 which is therefore proved. 

We now check the bounds on Si. The case of So follows by the fact that the fibers over 
W have at most two elements, hence also their intersection with Z, and that \W \ ^ 
\W\ ^ \H^\ 2 . 

The case of Si splits into four almost identical subcases, corresponding to y\ = 1, 
j/i = — 1 (remember that we added this, borrowing it from Case (2)), y 2 = 1 or y\ = y 2 . 
We deal only with the first, say Si^: we have 

s hi < \r 1 (hv2)nz\. 

But using Lemma [3.131 we have 

|0- 1 (l,y 2 )nZ| = \{( Xl ,x^,x^y 2 ) e (C\(g)nH) 3 }\ ^ \H®\ 

for any given y 2 G , since x± 6 H determines the triple {x\,x 1 1 ,x 1 1 y 2 ). Therefore 

^ \H^\\H\ < \H^\ 2 , 

and similarly for the other three cases. 

Now for 5*2- Here also we sum over y± first, which is ^ ±1 (by our definition of W 2 ). 
The crucial point is then that an element y\ ^ ±1 is included in at most two conjugates 
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of B . Hence, up to a factor 2, the choice of y\ fixes that of the relevant conjugate B for 
which Case (2) applies. Next we observe that Cb = Cl(g) flB is a conjugate of the union 

C a U C a -i, 

where, as in Lemma [3.151 we define 

C ' a = {(o a" 1 ) 1 tGF 4' 
and a is such that a + a" 1 = Tr(y). Given y\ G and B containing yi, we have 

by m> 

y 2 e (H (2) n U) u (F (2) n t 2 U) 

for some t G Cb. We note that t 2 XJ is itself conjugate to C a 2 or C a -2. 

Then the size of the fiber (f)~ 1 (yx,y 2 ) D Z is determined by the number of possibilities 
for x\. As the latter satisfies 

xi G C B n H, 

we see that we must estimate the size of intersections of the type 

h n c 7 , h {2) n c 7 

for some fixed 7 G F* , as this will lead us to estimates for the number of possibilities for 
y 2 as well as x\. Using twice Lemma [3.151 we get 

\{V2 I (yi^^G^KStrp^fl^l^^ga 25 / 3 !^ 1 / 3 , 

(the factor 8 accounts for the two possible choices of B and the two "components" for 
2/2; and the factor 2 in the lemma) and 

\<tr\vi,V*)nZ\ ^Aa 2 \H\ l '\ 

This gives 

S 2 ^ 32a 31 / 3 \H\ 2 ^\H^\ ^ 32a 34 / 3 1#| 5 / 3 , 
as claimed. □ 

There now only remains to prove Lemma 13.141 

Proof of Lemma \3.14\ It will be convenient to compute the intersection C Hy^ l C HyJ C 
instead of C D y\C fl y 2 C, a change of notation whichs is innocuous. 

The computation is then based on a list of simple checks. We can assume that the 
regular semisimple element g is 

'a 



where a 4 7^ 1, because a = ±1 implies that g is not regular semisimple, and a a fourth 
root of unity implies that Tr(g) = 0, which is the third case of the lemma (recall that k 
is assumed to be algebraically closed). Thus the conjugacy class C is the set of matrices 
of trace equal to t = a + a^ 1 . 

The only trick involved is that, for any yi G SL 2 (/c) and x G SL 2 (A;), we have 

C n (xy^- 1 )-^ = x(x- l C n y^x^C) = x(C n y^CTjx- 1 

since x~ x C = Cx _1 , by definition of conjugacy classes. This means we can compute 
C H y± C, up to conjugation, by looking at C fl (yQ -1 C for any y[ in the conjugacy 
class of y\. In particular, of course, determining whether C fl y^C is infinite or not only 
depends on the conjugacy class of y\. 

The conjugacy classes in SL 2 (fc) are well-known. We will run through representatives 
of these classes in order, and determine the corresponding intersection C fl yf 1 C. Then, 
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to compute C fl C fl y 2 ( 1 C, we take an element x in C PI 2Af C, compute ?/2^, and 
C fl yjf C n corresponds to those x for which the trace of y 2 x is also equal to t. 
We assume y\ ^ ±1. Then we distinguish four cases: 



1 1\ /-l 1 



(3.10) y 1= ^ /3^±l,/3^a ±2 

/a 2 

We claim that D = C fl y^C is then given, respectively, by the sets containing all 
matrices of the following forms, parameterized by an element a G k (with a ^ in the 
third case): 

a a \ ( a -1 a 



(3-H) n «-i or 



(3.12) 



a~7 ^ a y 

a (-a 2 + at-l)/(2f)\ 
2i i - a y 



t (/3-a 2 )a 



/3+1 I -Oa- 1 t/3 



(3-i3) (V :) or (»; ° 

Let us check, for instance, the third and fourth cases (cases (1) and (2) are left as 
exercise), which we can do simultaneously, taking y\ as in (13.101) but without assuming 
^ a ±2 . For 

we compute 

' (3a (3b 



This matrix belongs to C if and only if [3a + (3~ l d = t = a + d. This means that (a, d) 
is a solution of the linear system 

a + d = t 
[3a + f3~ l d = t, 

of determinant (3~ l — [3 ^ 0, so that we have 

d * 



p + r (3 + i 

Write c = c'/(/3 + 1), b = b'/((3 + 1); then the condition on d and V to have det(x) = 1 
can be expressed as 

-c'b' = {(3-a 2 ){(3-a- 2 ). 

This means that either (3 is not one of a 2 , a -2 (the third case), and then c and d are 
non-zero, and we can parametrize the solutions as in A3. 1 2j) . or else (the fourth case) c or d 
must be zero, and then we get upper or lower-triangular matrices, as described in (I3.13p . 
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Now we intersect D (in the general case again) with y 2 1 C. We write 

= fxx x 2 \ 
m \x 3 x 4 J ■ 

We consider the first of our four possibilities now, so that x G D is upper-triangular 
with diagonal coefficients a, a -1 (as a set), see ( 13. lip . We compute the trace of y 2 x, and 
find that is 

ax3 + X\(x + x^pC , or ax^ + X\a~ + 0:4a. 

Thus, if X3 7^ 0, there is at most one value of a for which the trace is t, i.e., D D y 2 x C 
has at most two elements (one for each form of the diagonal). If X3 = 0, we find that x± 
is a solution of 

axi + a = t, 

or 

ax^ 1 + oT x X\ = t, 

for which the solutions are among 1, a 2 and a~ 2 , so that y 2 is upper-triangular with 
diagonal coefficients (1, 1), (a 2 , a~ 2 ) or (a~ 2 , a 2 ), and this is one of the instances of Case 
(2) of Lemma EIH 

Let us now consider the second of our four cases, leaving this time the third and fourth 
to the reader. Thus we take x as in (13.121) . and compute the trace of y 2 x as a function of 
a, which gives 

Ti(y 2 x) = — ^|a 2 + (xi - x 4 + y) a + ( x 4 + 2x 2 )t. 

The equation Ti(y 2 x) = t has therefore at most two solutions, unless x% = and 
X4 = Xi. In that case we have x^ = 1, and the constant term is equal to t if and only if 
X4 — 1 and x 2 = (so y 2 = 1) or 24 = ±1 and x 2 = 1 (and then y 2 = y\). Each of these 
possibilities corresponds to the exceptional situation of Case (1) of Lemma [3.141 

All in all, going through the remaining situations, we finish the proof. □ 

3.3. Proof of Helfgott's Theorem. We now prove Theorem 11.21 If p ^ 5, one checks 
numerically that trivial bounds already imply the theorem. So we assume that p ^ 7, 
which means that Lemma [3. 101 is applicable. We will show that 

(3.14) trp(H) ^ 2- 1 / 2 |#| 1 / 1512 

for p ^ 7, unless = SL 2 (F P ), where the latter case will arise by applying Proposi- 
tion 13.91 Then using Lemma 13. 2\ we derive 

trp(#) > max(2 1 / 2 ,2- 1 / 2 |iJ| 1 / 1512 ) > \H\ l ' ?M2 \ 

which is the precise form of Helfgott's Theorem we claimed. 

We define H = H^ 2 \ so that (by Lemma 13. l(jp there exists at least one maximal torus 
T involved with hence a fortiori involved with L = = H^. 

If, among all maximal tori involved with L, there is one for which the lower bound ( 13. 5 p 
(applied to H = L) fails, we obtain from Proposition 13.121 the lower bound 

trp(L) > (LI 1 / 168 > \H\^ 168 , 

and since trp(L) ^ a 9 by Ruzsa's Lemma, we get 

(3.15) a^\H\ 1 / 1512 ^2- 1 / 2 \H\ 1 / 1512 , 

which is (pTH]) . 

Otherwise, we distinguish two cases. 
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Case (1). There exists a maximal torus T involved with L such that, for any g G G, 
the torus gTg^ 1 is involved with L. 

As we can guess from (I3.5P and ( 13. 2p . in that case, the set L will tend to be rather 
large, so \L\ is close to |G|, unless the tripling constant is itself large enough. 

Precisely, writing T = T n G, we note that the maximal tori 

gTg- 1 = (gTg- 1 ) n G 

are distinct for g taken among representatives of G/Nq{T). Then we have the inequalities 

9 eG/7V G (T) 1 Gl Jl 

where /3 = trp(L), since each gTg^ 1 is involved with L and distinct regular semisimple 
elements lie in distinct maximal tori (and we are in a case where (I3.5P holds for all tori 
involved with L). 

Now we unwind this inequality in terms of H and a = tip(H). We have = H^ 8 \ 

so 

\H\ > a~ 6 \L^\ > 14 _1 a;~ 6 /3~ 4 (p — 1) 2 |L| 1//3 > 14" V" 6 /?" 4 ^ - l) 2 ^ 1 / 3 
by Ruzsa's Lemma. Furthermore, we have 

P = trp(L) = trp(# (4) ) ^ a 10 
by Ruzsa's Lemma again, and hence the inequality gives the bound 

\H\^U-V 2 a- m (p-l)\ 
which for p ^ 5 implies \H\ i3 100 _1 a;~ 69 |G|. But then either 
(3.16) trp(tf) = a ^ 20(T 1 / 69 |G'| 1 / 621 ^ 1 ^621 ? 

or else 

|#| ^ 2|G| 8 / 9 , 

which (via Proposition 13.91) are versions of the two alternatives we are seeking (in partic- 
ular the first implies (I3.14p .) 

Case (2). Since we know that some torus is involved with L, the complementary situ- 
ation to Case (1) is that there exists a maximal torus T involved with L = and a 
conjugate gTg -1 , for some g G G, which is not involved with L. We are then going to 
get growth using Lemma 13.51 There is a first clever observation (the idea of which goes 
back to work of Glibichuk and Konyagin [9] on the "sum-product phenomenon"): one 
can assume, possibly after changing T and g, that g is in H. 

Indeed, to check this claim, we start with T and h as above. Since if is a generating 
set, we can write 

g = hf-h m 

for some m ^ 1 and some elements hi G H. Now let i ^ m be the smallest index such 
that the maximal torus 

T' = {h i+ i ■ ■ ■ h m )T{h i+ i ■ ■ ■ h m ) 1 

is involved with L. Taking i = m means that T is involved with L, which is the case, 
and therefore the index % exists. Moreover % ^ 0, again by definition. It follows that 

(hihi + x • ■ ■ h m )T(hihi + i ■ ■ • h m ) 1 

is not involved with L. But this means that we can replace (T, g) with (T', hi), and since 
hi G H, this gives us the claim. 
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We now write h for the conjugator g such that L and the torus S = gTg^ 1 = hTh^ 1 
are not involved. Apply Lemma 13.51 with (H, K) = (H, hTh^ 1 R G) and n = 5. This 
gives 

|#(6)| \H(^nS\ 



\H\ \HWr)S\ 

But since L = and S are not involved (by construction), we have fl S\ ^2, 
by the easy part of the Key Proposition I3.12[ and therefore 

^%V 5) nS|. 

\H\ 2 1 1 
However, L and T are involved, and moreover 

h{H {s) n T)/^ 1 c h {10) n s = # (5) n s, 

so that 

|# (5) n s| > |# (8) n T| = |L (2) n T| ^ u^cr 4 )^ 1 / 3 

where a = trp(L), by the Key Proposition 13.121 (again, because (13.51) holds for all tori 
involved with L). 
Thus 



which translates to 



^28- 1 &- 4 \H\ 1 / 3 , 
\H\ 



a 10 \H\ > 28- 1 a- 36 \H\V\ 



by Ruzsa's Lemma. This is a rather stronger bound for a than before, namely 

(3.17) a = trp(#) > 28~ 1/46 |#| 1/138 ^ 2- 1 / 2 |#| 1 / 138 . 

To summarize, we have obtained three possible lower bounds of the right kind for a, 
namely (1315]) . (13161) and (l3TT7j) . one of which holds if ^ SL 2 (F P ). All imply <K1M . 
and hence we are done. 

3.4. Diameter bound. Corollary 11.31 is a well-known consequence of the growth theo- 
rem: by induction on j ^ 1, we see using Helfgott's Theorem that given a symmetric 
generating set S C G = SL 2 (F p ), either diamC(G, S) ^ 3 J , or 

\H {33) \ > \H\ {1+S)J 

where H = S U { 1 } . For 

"log log \ G 

J 



log(l + 6) 

the second alternative is impossible, and hence 

diame(G,S) ^ 3 j ^ 3(log |G|) (log3)/log(1+<5) , 
which gives the result since (log3)/log(l + 1/3024) ^ 3323. 

4. The Bourgain-Gamburd method 

The method of Bourgain and Gamburd [I] leads, from Helfgott's growth theorem, to a 
proof that the Cayley graphs modulo primes of a Zariski-dense subgroup of SL 2 (Z) form 
an expander family. Applying this method straightforwardly with explicit estimates (as 
done in [151 Ch. 4]), one obtains explicit expansion bounds (either for the spectral gap 
of the combinatorial Laplace operator, or for the discrete Cheeger constant). However, 
these constants are typically very small. 
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4.1. The L 2 -flattening inequality. This section applies - in principle - to all finite 
groups, and the basic expansion criterion that we derive (Corollary I4.4[ following essen- 
tially Bourgain and Gamburd) is also of independent interest. 

In rough outline - and probabilistic language -, the idea is to show that if two in- 
dependent SL 2 (F p )-valued symmetrically distributed random variables X\ and X 2 are 
not too concentrated, but also not very uniformly distributed on SL^Fp), then their 
product X1X2 will be significantly more uniformly distributed, unless there are obvious 
reasons why this should fail to hold. These exceptional possibilities can then be handled 
separately. 

Applying this to some suitable step X^ of the random walk (where the initial condition 
is obtained by different means), this result leads to successive great improvements of the 
uniformity of the distribution for X 2 k, X^, . . . , X 2 3k, until the assumptions of the lemma 
fail. In that situation, the index m = 2^k is of size about log and P(X2 m = 1) gives a 
suitable upper-bound on the number of cycles to obtain expansion, by a variant of what 
might be called the Huxley- Sarnak-Xue method (see [13] and [22]), as we now recall. 

Remark 4.1. In an earlier draft, we had claimed a much better bound (roughly exponen- 
tially better) by using non-dyadic steps, but this was due to a bad mistake which was 
pointed out by the referee, which we heartily thank once more. 

For a finite group G, we denote my d(G) the minimal dimension of a non-trivial ir- 
reducible unitary representation of G. Moreover, if X is a G-valued symmetrically- 
distributed random variable, we define the return probability rp(X) by 

rppQ = P(XxX 2 = 1), 

where (X 1 ,X 2 ) are independent random variables with the same distribution as X, or 
equivalently 

rp(X) = ^P(X = ^) 2 . 

geG 

Let S be a symmetric generating subset of G and T = G(G, S) the associated Cayley 
graph. The Markov operator M acts on functions on G by 

M<p{g) = t^Y,v(9s), 

and it is a self-adjoint operator. The spectral gap of G, as we normalize it, is equal to 
1 — f?p , where g^ is the largest eigenvalue of M, and it is therefore ^ 1 — gr, where gr is 
the spectral radius of M. 

By expressing spectrally the number of closed walks of length 2m from the origin in 
T, and relating the latter with the return probability rp(X m ), where (X m ) is the random 
walk on the graph governed by M, one gets 

Q 

Using positivity and the fact that G acts without invariant vector on the £>r-th eigenspace 
of M, it follows that 

d(G)g 2 r m < prp(X m ), 
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or in other words, we have a bound for the spectral radius in terms of the return proba- 
bility: for any m ^ 1, we have 

/ \G\ \V(2m) 

(4-1) 1-Ai(r)<er< (j^rp(X m )J 

We consider now two independent (not necessarily identically-distributed) G-valued 
random variables Xi, X 2 and let 

rp + (X 1; X 2 ) = max(ip(Xi),ippf 2 )). 

We attempt to bound rp(XiX 2 ) in terms of rp + (X 1; X 2 ). To do this while still remain- 
ing at a level of great generality, the following definition will be useful: 

Definition 4.2 (Flourishing). For 5 > 0, a finite group G is 5-flourishing if any symmetric 
subset H C G, containing 1, which generates G and has tripling constant trp(if) < \H \ 5 
satisfies # (3) = G. 

In particular, Theorem 11.21 states that all groups SL 2 (F p ), for p prime, are 1/3024- 
flourishing. 

We will prove a general L 2 -flattening theorem, which may be of general interest. In 
order to somehow streamline the proof, we do not explicitly describe here what U G large 
enough" means. However, all relevant steps where a condition on the size of G occurs are 
clearly marked, and in the second part of Section I4.3[ we will look back to express these 
as explicit inequalities. 

Theorem 4.3 (L 2 -flattening conditions). Let G be a finite group which is 5- flourishing 
for some 5 with < S ^ 1. Let X\, X 2 be symmetric independent G -valued random 
variables. 

Let < 7 < 1 be given, and assume that 

(4.2) P(Xi e xH) <: \G\-*> 

for all proper subgroups H C G and all x G G. 

Then for any e > 0, there exists 5\ > and c 3 > 0, depending only on e, 5 and 7, such 
that 

(4.3) rp(X,X 2 ) < c 3 max{ ] J IZ; , rp+ g | '; i X2) } 

when \G\ is large enough in terms 0/(5,5,7). 
More precisely, one may take 

(4.4) Si = - minf — — — , — 

v ; 2 V2c 2 + 1 2c 2 

where c 2 = 937 is as in Theorem \2.1\ and 
Proof. By definition, we have 



c 3 < 2 u ci <: 2 2438 . 



rp(X 1 X 2 ) = ^P(X 1 X 2 = o) 2 . 

geG 

We now decompose the ranges of the distribution functions 

Ui( X ) = P(X = x) 

into dyadic intervals. Consider a parameter I ^ 1, to be chosen later, and decompose 

[minP(X = x),maxP(X = x)} C [0, 1] = J U Ji U • • • U J/ 
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where 

']2" i - 1 ,2- i ], for0^i</ 
[0, 2- 1 ] for i = I. 

This gives two partitions of G in subsets 

A jti = {xeG | Vj{x) = P(Xj = x) e %}, 

for j = 1, 2. We note that 

(4.5) \A jti \ < 2* +1 

for j = 1, 2 and < i < J. 

Using this decomposition into the formula above, and the fact that 

P(X ± X 2 = g, Xi G A hI or X 2 G A 2J ) ^ P(X, G A v ) + P(X 2 G A 2jJ ) < M 
we obtain 

rp(X 1 X 2 ) = J]( E P(X X X 2 = G X 2 G A,,))' 

^8|Gf2- 2/ + 2^( P(XiX 2 = g,X 1 eA 1 „ X 2 G A 2 ^ 

geG 0^i,j<I 

^2 3 - 2I \G\ 3 + 2I 2 Yl ^P(X 1 X 2 = ^,X 1 GA lii , X 2 eA 2J ) 2 

0^i,j<I g&G 

by the Cauchy-Schwarz inequality. Furthermore, the inner sum (say, B(Ai ti , A 2 j)) in the 
second term is given by 



B(A 1:i ,A 2d ) = ^P(XxX 2 = g, X\ g A m , X 2 g A 2j -) 2 



sec 



= £( E p(^ = ^)p(^ 2 = y)) : 



^i(xi)i/i(x 2 )^ 2 (2/i)^ 2 (y 2 ) 



xiyi=X2V2 

2- 2t - 2 J\{(x 1 ,x 2 ,y 1 ,y 2 ) G x A^. | x lVl = x 2 y 2 }\ 
= 2- 2i - 2 iE(A 1 , i ,A 2 , j ) 

where E(A, B) denotes the multiplicative energy. 
Thus we have proved that 

(4.6) rp(X 1 X 2 )^2 3 - 2/ |G| 3 + 2/ 2 ]T 2~ 2 ^E(A hi , A 2J ). 

0^i,j<I 

We now want to get upper-bounds in terms of the return probability rp + (Xi, X 2 ). This 
is done in different ways, depending on the size of the subsets A\j, A 2 j. We recall first 
the "trivial" bounds 

(4.7) E(A,B) ^ min(|A| 2 |S|, |A||,B| 2 ). 
We claim that for all % and j, we have 

(4.8) 2- 2 ^E(A 1>h A 2 j) ^ 2 4 T P + (X 1 ,X 2 )e(A 1 „A 2J ), 
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and that, for all a ^ 1, we have 

(4.9) 2- 2 ^E(A hi , A 2jj ) *C a- 1 rp+(X 1 ,X 2 ) 

unless 

\ A l,i\ ^ 1 \ A 2,j\ ^ 1 



(4.10) ^ -^=, ^ _. 

v ; 2* 2y/a' 2^ 2V« 

To see (14.81) . we remark that 

rp + (X 1; X 2 ) > ^ (rp(X x ) + rp(X 2 )) = ^ (P(X X = ^) 2 + P(X 2 = 

| \A 2 ,j\\ lQA^A^) 1 /* 



2 V 2 2+2i 2 2+2 -?' / 4 2*+-?' 
for any choice of i and j. Hence we get 

2 -^E(A hi , A 2d ) = 2- 2 ^e(A 1 „ A 2J )(\A 14 \\A 2J \) 3 / 2 



^4rp + (X 1 ,X 2 )e(A lii ,A 2 „ 



^ 16rp + (X 1 ,X 2 )e(A M ,A 2j ) 

by (1431). 

As for fl4~9l . if we assume that 2" 2 ( i+ ^ J B(Ai ii , A 2)i ) > «" 1 rp + (Xi, X 2 ), then we write 
simply 

2- 2 (^)|A 1)i | 2 |A 2j -| ^ 2- 2 ^)E(A M ,A 2j ) ^ af 



-ll^2j| 



- 2 2 + 2 J 

using (14. 7p . and get the first inequality of (I4.10p . the second being obtained symmetrically. 
With these results, we now fix some parameter a ^ 1, and let 

Pa = {{ij) | ^ i,j < I, \A 1;i \ > T- l a- 1 and |A 2J | > 2^ 1 «- 1 }. 
For ^ P a , we have 

2~ 2 ^E(A hu A 2J ) ^ a- 2 rp + (X 1; X 2 ) 
by (14.91) and (I4.10p . and thus from (I4.6p . we have shown that 

rp(X!X 2 ) ^2 3 - 2/ |G| 3 + 2a- 2 rp+(X 1 ,X 2 )/ 4 + 32rp + (X 1 ,X 2 )/ 2 ^ e{A lji ,A 2jj ) 

(estimating the size of the complement of P a by I 2 ). We select 

/- < 3^01), 

and hence obtain 

rp(X 1 X 2 ) < ^ + 2 8 rp + (X 1 ,X 2 )(log3|G|) 2 { (1 ° g y )2 +2 £ e (A M , A 2J )}. 

We apply this bound with a = \G\ S °, where <5 > will be chosen later. Thus 
rp(X x X 2 ) ^ ^ + 2 8 rp + (^i,^ 2 )(log3|G|) 4 |Gr 25o + 

2 9 (log3|G|) 2 rp+(X 1 ,X 2 ) ^ e(A lti , A 2J ) . 
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Let then 

R a = G P a I e(Ax,i,A 3 j) > a' 1 } C P a , 

so that the contribution of those G P a which are not in R a , together with the middle 
term, can be bounded by 

2"(log3|G|) 4 ■ 

rp (Xi, X 2 ). 

We can now analyze the set R a ; it turns out to be very restricted when Sq is chosen small 
enough. By Theorem 12. 1\ for each G R a , there exists a /^-approximate subgroup 
Hjj and elements (xi,yj) G A\^ x A 2 j such that 

|H M -| < fa\A lyi \, \AxtC\Xittij\ > K x \A u l \A 2tj n H i>jyj \ ^ 0^1 A 3 J, 

and with tripling constant bounded by (3<±, where the are bounded qualitatively by 

for some absolute constants, which we take to be c\ = 2 2424 , c 2 = 937 using ( 12. 2p . We 
then note first that if Hij denotes the "ordinary" subgroup generated by Hjj, we have 

(4.11) P{Xt G Xi Hij) > P{X X G XiH itj ) > 

P(Xx G Ax,i n xMij) ^ P-^T > J7T- > ' 



f3 3 2*+! " 4/3 3 a " 4ci|G|( 1 + c 2)*' 
where we used the definition of P a . If Sq, is small enough that 
(4.12) (l + c 2 )5 <7, 

and if \G\ is large enough, this is not compatible with ( 14. 2p . and we can therefore assume 
that each Hjj (if any!) generates the group G. 

We next observe that Hj j can not be extremely small. Indeed, we have 

|H M -| ^ \xiH ij:j n A 1A \ ^ fe x \A Xt h 

on the one hand, and by applying (14. 2 p with H = 1, we can see that A X i is not too small, 
namely 

max 9 g G P(Ai = g) 2 l+1 Aa 

using again the definition of P a . 
This gives the lower bound 

(4-13) |H i7 -| ^ -R^- ^ -^IGI 71 

v ; 1 Jl 4a/3 3 4ci 1 

with 71 = 7 — <5o(l + C2) (which is > by (I4.12p ). and then leads to control of the tripling 
constant, namely 

(4.14) trp(Hij) < fa ^ ci\G\ C2So c 1 (4c 1 ) 25o ^ 1 |H ij -| C25 °^ 1 . 



Since we assumed that G is ^-flourishing, we see from Definition 14.21 that if 5 is such 
that 

(4.15) ^ = ^0 

7i 7-(l + c 2 )5o 

and again if |G| is large enough, the approximate subgroup Hjj must in fact be very large, 
specifically it must satisfy 
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and in particular 

Pi Ci 

Intuitively, this implies that X± and X 2 are already rather uniformly distributed over G, 
and hence that rp + (X 1; X 2 ) is already too small to be significantly improved at the level 
of XiX 2 . To express this idea concretely, we go back to the first stage of the argument, 
namely (14. 6ft : the contribution to rp(X!X 2 ) coming from (i, j) was bounded by 

MA\A 2 A 2 „ 1 



22(i+i) ^ 2*~ 3 
by (14.51) . But then we also have 

2 m > \A lti \ > ^ > f}r > c^\G\^\ 

P2 P2P4 

(observe that /3 2 /3 4 ^ Ci|G| C2<5 °) and therefore 

2- 2(i+j) E{A lti ,A 2J ) < 16 Cl |G|- 1+2c2<5t '. 

Using again the trivial bound I 2 ^ 9(log3|G|) 2 for the number of possible pairs 
to which this applies, the conclusion is an inequality 

(4 16) rofX X )< 1 + 2"r ^g 3 '^ 4 + oi3 (log 3|C|) 4 

(4.1bJ rp(AiA 2 ) ^ — + 2 ci | G |!_ C25o + 2 — r P (X 1 ,X 2 ), 

which holds (under the assumptions that |G| is sufficiently large) for all Sq small enough 
so that (14.121) and (I4.15P are satisfied. It is elementary that (I4.15P is stronger than (I4.12p 
and is equivalent with 

do< (l + S)c 2 + 6> 

which holds when 5q < 5 r y/(2c 2 + 1) (since we assume 5 ^ 1). 
Thus we can apply this for 

5o = minf — — — , ^ = 26\, 
V2c 2 + l'2c 2 y 

where 8\ is given by (14. 4p . Then for \G\ large enough, (I4.16P implies (14. 3p . and hence we 
have finished the proof of Theorem 14.31 □ 

We can summarize all this as follows (with the same remark as before concerning our 
handling of the conditions on the size of G): 

Corollary 4.4 (The Bourgain-Gamburd expansion criterion). Let c = (c, d, 8, 7) be a 

tuple of positive real numbers, and let S(c) be the family of all finite connected Cayley 
graphs Q(G, S) for which the following conditions hold: 

(1) We have d(G) > \G\ d ; 

(2) The group G is 5 -flourishing; 

(3) For the random walk (X n ) on G with X — 1, we have that 

P(X k e xH) ^ |G|~ 7 

for some k clog \G\ and all x G G and proper subgroups H C G. 

Then, for any T e 9(c) with \T\ large enough, the spectral gap of the normalized Laplace 
operator of T satisfies 

A.(T)>l-«p(-^), 
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where 

/2c 2 + 1 16c 2 \ 

Note that it is not clear at this point that this corollary is not an empty statement (or 
one that applies at most to finitely many graphs with a bounded valency). But in the 
next section we will check that it applies to the situation of Theorem 11.11 to prove that 
certains families of Cayley graphs are expanders. 

Proof. Let V = C(G, S) be a graph in S(c). We will apply Theorem 14.31 with e = d/2 so 
that 

r 1 • ( h d \ 

Oi = - mm , 

2 V2c 2 + l'4c 2 / 

When \G\ is large enough, we can rephrase the conclusion using the simpler inequality 

(4.17) rp(Y 1 y 2 ) ^camax^pp^, j ^ max ^___ _____ 

for random variables Yi, y 2 which satisfy the assumptions of this theorem. 

Let k = |_clog|G|J be given by (3). We apply the theorem to Y\ = X 2Jfc and Y 2 = 
X 2 (j+i) k Y-^ 1 for j ^ 0. These are indeed independent and symmetric random variables, 
and Conditions (2) and (3) imply that we can indeed apply Theorem 14. 3 1 to these random 
variables for any j ^ 2. Since Y\ and Y" 2 are identically distributed, we have 

rp+(y 1 ,y 2 )=rp(y 1 ) = rp(X 2JA; ). 

Thus, applying the theorem, we obtain by induction 

rp(X 2Jfc ) ^ rp(X fc )|G|-^/ 2 ^ |G|-^ 2 

when j is such that 

| G |l-M/4 > 

and for larger j, we get 

rp(X 2Jfc ) < |GT 1+M / 4 . 
In particular, we obtain this last inequality for 



J 



2(1 - 3d/ 4) 



4 /2C2 + 1 4c 2 



Si 

which, by the "cycle-counting" inequality (14. ip . gives 

^^(|G| 1 - d rp(X ife ))^)^exp(-^) ) 
which thus proves the theorem. □ 



4.2. Expansion bounds for SL 2 . Theorem 11.11 will now be proved by applying the 
criterion of Corollary 14.41 Thus we will consider the groups G p = SL 2 (F P ) for p prime, 
for which Condition (1) of the Bourgain-Gamburd criterion (which is purely a group- 
theoretic property) is given by 

d(SU(F p )) = ^ 

for p ^ 3 (a result of Frobenius), which gives a value of d arbitrarily close to 1/3, for p 
large enough. Condition (2) is given by Helfgott's Theorem, with 5 = 1/3024. Note that 
it is purely a property of the groups SL 2 (F P ). 

Condition (3), on the other hand, depends on the choice of generating sets. The 
symmetric generating sets S p in Theorem 11.11 are assumed to be obtained by reduction 
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modulo p of a fixed symmetric subset S C SL 2 (Z). We will argue first under the additional 
assumption that S C SL 2 (Z) generates a free group. 

We begin with a classical proposition, whose idea goes back to Margulis. For the 
statement, recall that the norm of a matrix g e GL„(C) is defined by 

ii H \(gv,w)\ 

\\g\\ = max — — — — - 
v,w^=o \\v\\\\w\\ 

where (■, ■) is the standard inner product on C ra . This satisfies 

( 4 -!8) Hfl^H < IMIII^H, max\g i:j \ < ||^|| for g = (g i:j ), 

the latter because g^j = {ge^ ef) in terms of the canonical basis. 

Proposition 4.5 (Large girth for finite Cayley graphs). Let S C SL 2 (Z) be a symmetric 
set, and let V = G(G, S) be the corresponding Cayley graphs. Let r > be defined by 

(4.19) r _1 = logmax ||s|| > 0, 

which depends only on S. 

(1) For all primes p and all r < rlog(p/2), where G p = SL 2 (F P ), the subgraph T r 
induced by the ball of radius r inY maps injectively to G(G P , S). 

(2) IfG is freely generated by S, in particular 1 ^ S, the Cayley graph Q(G P , S) contains 
no cycle of length < 2rlog(p/2), i.e., its girth girth(C(G p , S)) is at least 2rlog(p/2). 

Proof. The main point is that if all coordinates of two matrices ji, j 2 6 SL 2 (Z) are less 
than p/2 in absolute value, a congruence g\ = g% (modp) is equivalent to the equality 
gi = g 2 . And because G is freely generated by S, knowing a matrix in G is equivalent to 
knowing its expression as a word in the generators in S. 

Thus, let x be an element in the ball of radius r centered at the origin. By definition, 
x can be expressed as 

X S\ ' ' ' S m 

with m ^ r and Sj G S. Using (I4.18p . we get 

max \xi j\ \\si\\ ■ • ■ \\s m \\ ^ e m//r ^ e r ^ T . 

Applying the beginning remark and this fact to two elements x and y in the ball Si(r) 
of radius r centered at 1, for r such that e r l T < |, it follows that x = y (modp) implies 
x = y, which is (1). 

Then (2) follows because any embedding of a cycle 7 : C m —> C(G P , S) such that 
7(0) = 1 and such that 

d(l,7(i)) ^ m/2 < rlog(p/2) 

for all i can be lifted to the cycle (of the same length) with image in the Cayley graph of 
G with respect to S, and if S generates freely G, the latter graph is a tree. Thus a cycle 
of length m = girth(C(G p , S)) must satisfy m/2 ^ rlog(p/2). □ 

We can now check Condition (3) in the Bourgain-Gamburd criterion, first for cosets 
of the trivial subgroup, i.e., for the probability that X n be a fixed element when n is of 
size clogp for some fixed (but small) c > 0. As we did earlier, we clearly mark where we 
impose conditions on the size of p, and these will be made explicit in Section 14.31 

Corollary 4.6 (Decay of probabilities). Let S C SL 2 (Z) be a symmetric set, G the 
subgroup generated by S . Assume that S freely generates G. Let p be a prime such that 
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the reduction S p of S modulo p generates G p = SL 2 (F p ), and let (X n ) be the random walk 
on C(G P , S p ) with X = 1. Let 

r _1 = log max llsll > 0, 

as in Proposition ^.^ 

Fix a constant c with < c ^ 1. If p is large enough, depending on c and S , then for 

n — c\r log(p/2)J 

and any x G SL 2 (F p ) ; we have 

(4.20) P(X n = x)< t \G p \-°» 

where 

o 

More precisely, this holds for all 

2 \ 



(4.22) p ^ max (l7,2 exp(— 

The "extra" parameter c will be useful in the argument involving all proper subgroups 
H below. 

Proof. There exists xeG such that x reduces to x modulo p and x is at the same distance 
to 1 and by Proposition 14. 5[ (2), we have 

P(X n = x) = P(X n = x), 

for n ^ rlog(p/2), where (X n ) is the random walk starting at 1 on the IS^-regular tree 
G(G, S). By a well-known result of Kesten [16], we have 

P(X n = x) < r~ n with r = — j£! 

for all n ^ 1 and all i£G. Since c ^ 1 we have 

n = c[r log(p/2)J ^ cr log(p/2) — 1, 

and we obtain 

P(X n = a;) <r(£) < ' M ~ 



-2 

for p is 2r 2// ( crlogr \ Using the inequality 



P 

for p ^ 17, this becomes 



> IG 1 1/4 
2 ^ 1 pl 



p(x n = x) ^ |Gy- cr{logr)/8 

for all p ^ max(17, 2r 2 ^ crlogr ^). Since r ^ ^vi^l' we the desired result. □ 

In order to deal with cosets of other proper subgroups of SL 2 (F P ), we will exploit the 
fact that those subgroups are very well understood, and in particular, there is no proper 
subgroup that is "both big and complicated" . Precisely, by results going back to Dickson 
(see, e.g., the account in [23J Ch. 6] for PSL 2 (F p ), from which the result for SL 2 (F P ) 
follows easily), one knows that for p ^ 5, if H C SL 2 (F P ) is a proper subgroup, one of 
the following two properties holds: 

(1) The order of H is at most 120; 
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(2) For all (xi, x 2 , x 3 , x±) G H, we have 

(4.23) [[a;i,x 2 ], [x 3 ,x 4 ]] = 1. 

The first ones are "small" , and will be easy to handle using f l4.20p . The second are, from 
the group-theoretic point of view, not very complicated (their commutator subgroups are 
abelian). The following ad-hoc lemma^ takes care of them: 

Proposition 4.7. Let k ^ 2 be an integer and let W C be a subset of the free group 
on k generators (ai, . . . , a&) such that 

(4.24) [[x 1 ,x 2 ],[x 3 ,x i \] = 1 
for all (xi, x 2 , %3, £4) £ W . Then for any m ^ 1, we have 

\{x G W I d T (l,x) ^ m}\ < (4m + l)(8m + 1) ^ 45m 2 , 
where T is the \S\-regular tree G(F k , S), S = {af 1 }. 

Proof. The basic fact we need is that the condition [x,y] = 1 is very restrictive in F^: 
precisely, for a fixed 1 ^ 1, we have [x,y] = 1 if and only if y G Cp k (x), which is an 
infinite cyclic group. Denoting a generator by z, we find 

(4.25) \{y G Bi(m) | [x,y] = 1}\ = \{h G Z | d Tk (l,z h ) < m}| ^ 2m + 1 

since (a standard fact in free groups) we have dT(l,z h ) ^ \h\. 

Let W be a set satisfying the assumption f)4.24p . which we assume to be not reduced 
to {1}. We denote W m = W n r B 1 (m). First, if = 1 for all x, y G W m , then by 
taking a fixed x 7^ 1 in W 7 ™, we get W m C CF fc (x) fl Si(m), and (I4.25P gives the result. 

Otherwise, fix xo and yo in W m such that a = [xo,yo] 7^ 1. Then, for all y in W m we 
have [a, [xo,y]] = 1. Noting that dr(l, [xo,y]) ^ 4m, it follows again from the above that 
the number of possible values of [xo, y] is at most 8m + 1 for y G W m . 

Now for one such value b = [xq, y), we consider how many y\ G W m may satisfy [xo, y\] = 
b. We have [xq, y] = [xq, yi\ if and only if Lp(y~ l y\) = y~ x y\, where (p(y) = xoyx^ 1 denotes 
the inner automorphism of conjugation by xq. Hence y\ satisfies [xo,yi] = b if and only 
if ip{y~ l yi) = y~ 1 yi, which is equivalent to y~ l yi G CV fc (xo). Since y~ l y\ is an element 
at distance ^ 2m of 1 if y and y\ are in Si(m), applying (14.251) gives 

Ifce^iH I [x ,2/i] = [x ,y]}\ <4m+l, 

and hence we have \W m \ ^ (4m + l)(8m + 1) in that case, which proves the result. □ 

Using Corollary I4.6[ we finally verify fully Condition (3) in Corollary 14.41 

Corollary 4.8 (Decay of probabilities, II). Let S C SL 2 (Z) be a symmetric set, G the 
subgroup generated by S. Assume that S freely generates G. Let p be a prime such that 
the reduction S p of S modulo p generates G p = SL 2 (F P ), and let (X n ) be the random walk 
on Q(G P , Sp) with X = 1. Let 



r 1 = log max lis II > 0, 

s&S 



as in Proposition \4-5\ 

If p is large enough, then for 



n 



^log(p/2) 



6 Note that this is the only place where using prime fields F p instead of arbitrary finite fields really 
simplifies the argument, since (I4.23[) does not hold for proper subgroups of, say, SL2(F p 2). 
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any x G SL 2 (F p ) and any proper subgroup H C SL 2 (F p ), we have 

(4.26) P(X n G xH) ^ IGpT 7 

where 

(4,7) 7=! (M|^)). 

Proof. We start by noting that 

P(X n G a://) 2 < P(X 2n G if) 

for all x & G p and all subgroups H C G p . 

Consider first the case where (I4.23P holds for H. Let H C G be the pre-image of if 
under reduction modulo p. If 2n si rlog(p/2), then as in the proof of Corollary I4.6[ we 

get 

P(X 2n EH) = P(X 2n G H). 
Provided n also satisfies the stronger condition n ^ m = ^rlog(p/2), any commutator 

[[xi,x 2 ], [x 3 ,x 4 ]} 

with Xi G H n ®i(n) is an element at distance at most rlog(p/2) from 1 in the tree 
G(G, S), which reduces to the identity modulo p by (I4.23p . and therefore must be itself 
equal to 1. In other words, we can apply Proposition 14.71 to W = H fl £i(m) to deduce 
the upper bound 

\HH'B 1 {m)\ < 45m 2 . 

We now take 

n = ^L rl °g(p/ 2 )J> 

and we derive 

P(X 2n G H) < |ifn!Bi(m)|(maxP(X 2n = x)) ^ 45m 2 |G p |" 7l/16 

x£G p 

(where 71 is given by (I4.2ip , as in Corollary I4.6[) , and hence 



(4.28) P(X n G xH) ^ ^ T (\ogp/2)\G p \-^ 32 ^ \G P \-^ M 

provided p is large enough, which is the conclusion in that case. 

On the other hand, if (I4.23P does not hold, we have \H\ ^ 120, and for the same value 
of n we get 

(4.29) P{X n G xH) < 120|G p r 7l/32 ^ |G p r 7l/64 



for p large enough, by Corollary 14 . 6 1 with c = 1/32. This gives again the desired result. □ 

The following upper-bound on 7 was suggested by the referee: 
Lemma 4.9. With notation as in Corollary \4-8[ we have 

7 ^ 2" 5 . 

Proof. For n ^ 1, the cardinality of the ball r Bi(n) is at least (\S\ — l) n , and is at most 
\{geM 2 (Z) I (maxlHir, for 1 < i,j < 2} 

by (I4.18p . Thus, denoting A = max sg 5 ||s||, we find 

log(|Sl-l)<41og(2A + l), 
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and hence 



7 _ 2 -9 1 °g^vl' 5 'i) ^ 2 _ 7 log(2A + 1) 



log(A) " log(A) 

Now we note that either A ^ 2, or S is contained in the finite set of matrices in SL^Z) 
where all coefficients are in {—1,0, 1}. There are 20 such matrices, and all those which 
are not of finite order are parabolic. For these, we have ||s|| ^ \/2, and therefore A ^ \[2 
in all cases, and hence 

2 _ 7 log(2V2 + 1) 2 _ 5 _ 
log(v^) 

□ 

4.3. Summary. We can now summarize how to obtain an explicit spectral gap, for large 
enough p, in the situation of Theorem II. H finishing the proof. We then explain how to 
quantify the condition on p. 

We first consider the case where S C SL^Z) freely generates a free group of rank ^ 2 
(in which case it is automatically Zariski-dense in SL 2 ). 

Step 1 (when p is large enough). We have 

p — 1 



d(G 



pj ~ 2 

d 



for p ^ 3. In particular, d(G p ) \G P \ for any d < 1/3 provided p is large enough in 
terms of d. Moreover, by Theorem II .2| those groups are 5- flourishing with S = 1/3024. 

For the random walk (X n ) on G p associated to the generating set S p , with X = 1, we 
have 

P(X fe e xH) ^ |GT 7 

when 

k 

with 



r 



L32 



r 



-log(p/2) <-log(|G p |) 



r 1 = log max 7 



96 

-log(^v^) 



ses 11 ' 2 9 

by (I4.19P and f)4.27p . Thus in Corollary I4.4| we can take c = 1/96. The number of times 
we apply the basic L 2 -flattening inequality is bounded by 

/2c 2 + l 4c 2 \ /1875-3024 \ , 

j ^8max(^ 2 ^ ^8max(^ , 15000J = 48O6OOOO7- 1 

(using Lemma 14. 9[) and the spectral gap satisfies 

A,(D J, 1 - exp(-^) = 1 - exp(-^) > 

for all p large enough. For p ^ 17, we take d = 1/4, and this gives 

A 1 (r)^^-^^-^ 2- 226 ^ 1 . 
2^+ 4 c 2J+ 1 

Except that we incorporated the factor 2 9 from the current value of 7 to the constant 
factor (for esthetic reasons), this gives (11.11) . 

Step 2 (how large is "large enough"). We gather here, as a series of inequalities to be 
satisfied by p, the conditions under which we can apply the previous lower bound. These 
we gather from the proofs of the results of this section. First come inequalities that 
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make explicit the condition that |G| be large enough in Theorem 14.3] which are easily 
translated into conditions on p since | SL 2 (F P )| = p{p 2 — 1). 

• In order that (14.111) contradict (14.21) . we must have 

| G |7-*o(i+c2) > 4ci _ 

• In order that (14.1 4p contradict the growth alternative of Helfgott's Theorem, it is 
enough that 

r -1*1 (5-C25o7r 1 )~ 1 

|GT >4c 1 {c 1 (4c 1 )^ } 

whera^ 7i = 7 — (1 + c 2 )<5 (in view of (I4.13P ). 

• In order that (I4.16P give (14. 3p when 8\ satisfies (14.41) . it is enough that 

\G\ £ - 2c2So > (log3|G|) 4 , 

and that 

|G| 5 ^cr 2 (log3|G|) 4 . 

• In order that (14.171) hold, we must have 

(4.30) min(|Gf /4 , \G\ 5l/2 ) ^ c 3 . 

Now we list the conditions needed to apply the Bourgain-Gamburd criterion in the 
situation of Theorem 11.1] when 5* freely generates a free group of rank 15*1/2 ^ 2. 

• We need 

p ^ max ^17, 2exp^ — 

by (1422]) . 

• In order that the last inequality in (I4.28P hold, as well as ( I4.29p . it is enough that 



SL 2 (F P )P ^max(l20, (log| 



Remark 4.10. Below in Section |4~51 is found a straightforward Pari/GP [20] that com- 
putes the lower-bound of Step 1 for the spectral gap, given the set of matrices S, and 
that can also be used to determine for which p the bound is known to be applicable. 

We finally explain how to reduce the full statement of Theorem 11.11 to the case where 
the given symmetric subset S C SL 2 (Z) generates a free group, which is the one treated 
by the Bourgain-Gamburd method. 

For a given S C SL 2 (Z) which generates a Zariski-dense subgroup G of SL 2 , the 
intersection GnT(2), where T(2) is the principal congruence subgroup modulo 2, is a free 
subgroup of finite index in G. From a free generating set, one can extract two generators 
Si, s 2 G G to obtain a free subgroup of rank 2 of G, say Gi (since G fl T(2) has finite 
index in G, it is still Zariski-dense, and hence has rank at least 2). This subgroup is still 
Zariski-dense. We can then compare the expansion for the Cayley graphs of SL 2 (F P ) with 
respect to 5* and to Si = {sf 1 , sf 1 }. 

For p large enough so that G p = SL 2 (F P ) is generated both by 5* modulo p and 5*i 
modulo p, we have 

d(x,y) ^ Cdi(x,y) 

where di(-, ■) is the distance in the Cayley graph Ti = Q(G P , Si (modp)), and d(-, ■) the 
distance in T 2 = Q(G P , S (mod p)) and C is the maximum of the word length of Si, s 2 



This is not the same 71 that occurs in the proof of the decay of probabilities. 
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with respect to S. Hence, by a standard lemma (see, e.g., [Tj)J Lemma 3.1.16], applied to 
Ti and T 2 with / the identity), the expansion constants satisfy 

h(G(G p , S (mod p))) = h{T 2 ) ^ w- l h{<Z{G p , S 1 (modp))) 

with 

W = 4J2\S\ J ~ 1 - 

3=1 



In particular, using Theorem II. II for G\, we obtain the expansion property for G, and 
we can bound the spectral gap explicitly once we know expressions for the generators s\, 
s 2 in terms of those in S. 

As the referee pointed out, Breuillard and Gelander [21 Th. 1.2] have proved a strong 
uniform version of the Tits alternative which implies that there exists an absolute constant 
N ^ 1 such that, for any Zariski-dense subgroup G C SL 2 (Z), and for any symmetric 
generating set S C G, the combinatorial ball of radius N in G(G, S) contains two elements 
which generate a free subgroup of rank 2 of G. If a concrete value of N was known (which 
does not seem to be the case yet), one could use the above argument to state a version 
of the second part of Theorem 11.11 without the assumption of freeness. 



4.4. Diameter bound. We can now also prove quickly Corollary 11.41 Let Si = SU {1}. 
By Proposition I4.5| if we let 

r = |r log - J , 

where r is defined by (I4.19p . the size of Si^ is at least the size of a ball of radius r in a 
jS^-regular tree, which is well-known to be at least s r , where s = \S\ — 1. 
For p ^ 17, this gives 

Sjr) ^ s _1 (|) r > s- 1 |SL 2 (F p )| T ( logr ')/ 4 , 

and if p ^ exp(2r _1 ), this becomes 

S X W > |SL 2 (F p )| 52 , 

where 

r(logg) 
5 2 = — ^— > 0. 

Now we apply repeatedly Helfgott's Theorem with H = S\^ r \ For j such that 

F> log(Q 
J ^log(l + 5)' 

the S-^-fold product of H must be equal to SL 2 (F p ), and hence we get 
diame(SL 2 (F p ),5) ^ 3 J r < ^(log | SL 2 (F P )|), 

and taking 



J 

this gives the bound 



log(^" 1 ) 
log(l + 6) 



diame(SL 2 (F p ),S) ^ 3 log(5 " )/log(1+5) (log | SL 2 (F P )|). 
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4.5. Script. Here is a Pari/GP [2U] script that performs the computations needed to 
obtain an explicit spectral for Theorem 11.1] given as input a set of matrices S which 
generate a free group (this condition is not checked). 

\\ Norm of a matrix 

matnorm(m)=sqrt ( sum (i=l,matsize(m) [1] , sum (j=l,matsize(m) [2] ,m[i,j]~2))) 

\\ Spectral radius of random walk on k-regular tree 
gapr(s)=local(k) ;k=length(s) ;k/2/sqrt(k-l) 

\\ Growth constant in Helfgott's Theorem 
gapdelta(s) =1/3024 

\\ Minimal dimension of irreducible , DK for p at least 17 
gapd(s) =1/4 

\\ Constant c_2 in explicit multiplicative combinatorics 
gapc2 (s) =937 

\\ Logarithm of c_l , base 2 
gaplogcl (s) =2424 

\\ Logarithm of c_3 , base 2 
gaplogc3(s) =2438 

\\ "tau" invariant 

gaptau(s) = l/ log(vecmax (vector (length(s) ,i ,matnorm(s [i] ) ) ) ) 
\\ Value of gamma for p large enough 

gap gamma (s) =gaptau (s) * log (2/sqrt (3) * sqrt ( length (s)))/2~9 

\\ Bound for minus the logarithm in base 2 of spectral gap 
\\ for p large enough 

gaploggap(s) =2~26/gapgamma(s) 

\\ Value of delta_0 

gapdeltaO(s)=min(gapdelta(s)* gap gamma (s)/ (2*gapc2(s)+l) ,gapd(s)/8/gapc2 
(s)) 

\\ Value of delta_l 

gapdeltal (s) =l/2*min(gapdelta(s) * gap gamma (s)/(2*gapc2(s)+l) ,gapd(s)/8/ 
gapc2 ( s ) ) 

\\ Value of gammal in lower-bound conditions 
gapgammal ( s ) = gap gamma (s) -gapdeltal (s) * (l+gapc2 (s) ) 

\\ First minimal value on log p, base 2 

gaplogminl (s) =(2+gaplogcl (s) ) /3/( gap gamma (s) -gapdeltal (s) * (l+gapc2 (s) ) ) 



\\ Second minimal value on log p, base 2 
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gaplogmin2(s) =l/3/gapgammal (s)*(2+gaplogcl(s)+l/(gapdelta(s)-gapc2(s)* 
gapdeltaO(s) /gapgammal (s))*(gaplogcl(s) +l/gapgammal (s)*(2+gaplogcl(s 
)))) 

\\ Is log(p)=lp larger than third minimal value on log p (base e)? 
gapislogmin3 (s , lp)=if ( lp >= 1/3/ (gapd ( s ) /4-2*gapc2 (s) *gapdeltaO (s) ) *4* ( 
log(log(3)+lp) ) ,1 ,0) 

\\ Is log(p) larger than fourth minimal value on log p (base e)? 
gapislogmin4(s,lp)=if(lp>=l/3/gapdelta0(s)*(4*log(log(3)+lp)-2*log(2)* 
gaplogcl (s) ) ,1,0) 

\\ Fifth minimal values on log p, base 2 

gaplogmin5(s)=gaplogc3(s)/min(gapd(s)/4,gapdeltal(s)/2) 

\\ Constant c used in sixth minimal value 
gape (s) =gaptau (s) /96 

\\ Sixth minimal value on log p, base 2 

gaplogmin6 ( s) =max (log (17)/log(2) , 1+ (2/gaptau (s) /gape (s) ) /log (2) ) 

\\ Is log(p) larger than seventh minimal value on log p, base e 
gapislogmin7(s ,lp)=if ( 3* lp * gap gamma (s)>=log(lp-log(2)) ,1,0) 

\\ Eighth minimal value on log p, base 2 
gaplogmin8(s)=log(120) / log (2) /3/gapgamma ( s ) 

\\ Minimum of log(p), base 2, for gapislogmin3 
gapfind3(s)= { 
local ( j =2 , i , k) ; 

while(!gapislogmin3(s,j) , j = 2* j ) ; 
k=j/2; 

i=ceil((j+k)/2) ; 
while ( i ! = j , 

if ( ! gapislogmin3 (s , i) , 

k=i ; i = ceil ( ( j+k) /2) , 

j=i;i = ceil((j+k)/2))) ; 
ceil(i/log(2) ) ; 

} 

\\ Minimum of log(p), base 2, for gapislogmin4 
gapfind4(s)= { 
local ( j =2 , i , k) ; 

while(!gapislogmin4(s,j) , j = 2* j ) ; 
k=j/2; 

i=ceil((j+k)/2) ; 
while ( i ! = j , 

if ( ! gapislogmin4 (s , i) , 

k=i ; i = ceil ( ( j+k) /2) , 

j=i;i = ceil((j+k)/2))) ; 
ceil(i/log(2) ) ; 

} 



35 



\\ Minimum of log(p), base 2, for gapislogmin7 
gapf ind7 ( s ) = { 
local ( j =2 , i , k) ; 

while ( ! gapislogmin7 (s,j) ,j=2*j) ; 
k=j/2; 

i=ceil((j+k)/2) ; 
while ( i ! = j , 

if ( ! gapislogmin7 (s , i ) , 

k = i;i = ceil((j+k)/2) , 

j=i;i = ceil((j+k)/2))) ; 
ceil(i/log(2) ) ; 



\\ Minimum value of log(p) , base 2 

gapmin (s) =ceil ( vecmax ( [gaplogminl (s) , gaplogmin2 ( s ) ,gapfind3(s) ,gapfind4 
(s) , gaplogmin5 ( s ) , gaplogmin6 ( s ) ,gapfind7(s) , gaplogmin8 ( s ) ] ) ) 

\\ Base 2 bound for gapmin(s) 
gapminlog(s)=ceil ( log ( gapmin (s) )/log(2) ) 

\\ Generators of the Lubotzky group 

ls = [[1 ,3;0 , 1] ,[1,-3:0,1] , [1 ,0;3 , 1] , [1 ,0; -3 , 1]] 

\\ ? gaploggap ( Is ) 

\\ gaploggap ( Is ) 

\\ 7,29 = 49218765900.678024122794300454114797957 

\\ ? log ( gaploggap ( Is ))/ log (2) 

\\ log ( gaploggap ( Is )) /log (2) 

\\ 7,30 = 35.518489433339899156620630706081878314 

\\ ? gapminlog ( Is ) 

\\ gapminlog ( Is ) 

\\ 7.31 = 46 



In this appendix, we sketch the proof of Theorem 12.11 following very (essentially) line 
by line Tao's paper [24J. The presentation is therefore highly condensed, though we use 
a "diagram" notation which should make it relatively easy to check how the values of the 
constants evolve. 

Below all sets are subsets of a fixed finite group G, and are all non-empty. 

5.1. Diagrams. We will use the following diagrammatic notation: 
(1) If A and B are sets with Ruzsa distance 



> 



5. Appendix: proof of Theorem 12.11 




such that d(A, B) ^ log a, we write 



A* 



a 



(2) If A and B are sets with \B\ ^ ot\A\, we write 



B* 



a 



A, 
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and in particular if \X\ ^ a, we write X • a > 1 , 

(3) If A and B are sets with e(A, B) ^ 1/a, we write 

(4) If A C -B, we also write A> ^ B . 

The following rules are easy to check (in addition to some more obvious ones which we 
do not spell out): 

(1) From 

A* »B 

a 

we can get 

a^Lb, B^A. 

(2) (Ruzsa's triangle inequality) From 



we get 

(3) From 
we get 

(4) ( "Unfolding edges" ) From 



A' >B. .C 



A» »C . 

OL\OL2 



CJ^B^A 



C^lA. 



B^+A 



we get 

AB~ X » ^ , A 
(note that by the first point in this list, we only need to have 

B» »A 

to obtain the full statement with a = (3 2 , which is usually qualitatively equivalent.) 
(5) ("Folding") From 

AB-'^^A^B 



we get 



A* *B . 

a/3 1 / 2 



Note that the relation A* - > B is purely a matter of the size of A and B, while the 

other arrow types depend on structural relations involving the sets (for A > >■ B ) and 

product sets (for A* - »B or A» a »B). 
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5.2. Proofs. First we state the Ruzsa covering lemma [2H Lemma 3.6] in our language: 
Theorem 5.1 (Ruzsa). If 

there exists a set X which satisfies 

X> ^B, X»-l+l, B> ►A" 1 AX", 

and symmetrically, if 

BA^+A, 

there exists Y with 

Y> -5 , F—^l, B> ^YAA- 1 . 

Next we have the link between sets with small tripling and approximate subgroups [2U 
Th. 3.9 and Cor. 3.10]: 

Theorem 5.2. Let A = A~ l with 1 £ A and 

AW^L+A. 

Then H = A^ is a (2a 5 ) -approximate subgroup containing A. 
Proof. We have first 

H—^+A, A> >H . 

Then by Ruzsa's lemma I3"TT| we get 

and by the Ruzsa covering lemma there exists X with 

X> *HW, X+^+l, 

such that 

H m > _ A (2) X , _ A ^X = EX . 

Taking X\ = X U X -1 , we get a symmetric set with 

X ± > -i/( 2 ), X x ^l, 

and 

> ^ EX , > ^ XH , 

which are the properties defining a (2a 5 )-approximate subgroup. □ 

The next result is the explicit form of [2U Th. 4.6, (i) implies (ii)]: 
Theorem 5.3. Let A and B with 

A' »B- 1 

a 

Then there exists a ^-approximate subgroup H and a set X with 

X+^+1, A> *XH, B> >HX, H^+A, 

where 

7 ^ 2 21 « 80 , 7l ^ 2 2 V 04 , 72 < 8« 14 . 
Furthermore, one can ensure that 

(5.1) H®^-^ >H. 

38 



Proof. From 



we get first 

AA- l ^A. 

By [21 Prop. 4.5], we find a set S with@ 1 6 5 and S = S' 1 such that 

A^S, ASM A- 1 * 2 "" 4 " +2 , A 
for all n ^ 1. In particular, we get 

AS-^AS* — — >■ A , 5. — — >■ A . 

We have 

g( 3 >. 8 " 14 : A. . S, 

and Theorem 15.21 says that H = S® is a 7- approximate subgroup containing S 1 , with 
7 = 2(16a 16 ) 5 = 2 21 a 80 , and (as we see) 

H. te " . A. 

Moreover, we have 

H (s) = s <9) > , AS^A- 1 ■ 29 " 38 . A^S, 

which gives (15. ip . 
Now from 

A# = ^ » 8q14 > A • 5 -3- if , 

we see by the Ruzsa covering lemma that there exists Y with 

Y> *A, Y> — 16 " 16 > 1 , A> *YHH . 

By definition of an approximate subgroup, there exists Z with 

and hence 

A> »- (yz)# . 

Now we go towards B. First we have 

AH- 1 = AS® . 8 " 14 > A ^ if 
which, again by folding, gives 

A* »H 

ai 

with oi\ = 8\/2a 15 . Hence we can write 

H» »A» 'B- 1 , 

and so 

H» 'B- 1 . 

ttOtl 



8 The property 1 G S is not explicitly stated in [23], but follows from the explicit definition used by 
Tao, namely S = {x e G | \Af] Ax\ > {2a 2 )- 1 \A\}. 
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In addition, we have 



and therefore we get 



from which it follows by unfolding that 

B-^H- 1 = B- l H. 32q2 ° > B- 1 ^ H . 

Once more by the Ruzsa covering lemma, we find Y\ with 

96^24 

Y x > ^B- 1 , Yi • — — B~ 1 > s- Y 1 HH> »- (Y 1 Z)H . 

Now we need only take X = {Y\Z U YZ), so that 

with 7! = 7(64a 24 + 16a 16 ), in order to conclude. Since 

7i ^ 2 2 V 04 , 

we are done. □ 

The next result is a version of the Balog-Gowers-Szemeredi Lemma found in [2U Th. 
5.2]. 

Theorem 5.4. Let A and B with 

A» a »B . 

Then there exist A\, B\ with 

A x > >- A , B x > ^B, 

as well as 

A. 8V ~ 2a . Ai , B. 52-^S x> 

and 

Ax* *B7 l 

where ct\ = 2 23 a 9 . 

This is not entirely spelled out in [24J, but only the last two or three inequalities in 
the proof need to be made explicit to obtain this value of a±. Finally, the next theorem 
is just the "diagrammatic" version of Theorem 12 .![ and therefore completes its proof. It 
is an explicit version of [2JJ Th. 5.4; (i) implies (iv)]. 

Theorem 5.5. Let A and B with 

,1. " *B . 

Then there exist a (3 -approximate subgroup H and x, y G G, such that 
hJ^A, AJ^+AnxH , B «-^*- B fl Hy , 

where 

(3 < 2 186 V 20 , ft ^ 2 242 V 37 , ft < 2 32 V 26 . 
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Moreover, one can ensure that 



where 3 = 2 930 a 360 . 

Proof. By the Balog-Gowers-Szemeredi Theorem, we get A\, B\ with 

A x > A , B x > ^B, 

as well as 

A* 8V ~ 2a : A, , B. *-^-B Xi 

and 

Ax- 'B7 1 

where ot\ = 2 23 a 9 . Applying Theorem 15. 3l to A x and B x , we get a /3-approximate subgroup 
H and a set X with 

H» Ai— A 

and 

X—^l, Ai> ^X#, Si> ^M, 

where 

a o21^80 o 1 861^720 „, 28 104 O 2420^936 

and moreover 

where & = 2 10 < = 2 930 « 360 . 

Applying the pigeonhole principle, we find x such that 



and y with 



A. — 8v ^° » Ax^-^AxHa:^ A n xH 



B» ^B 1 <^+B 1 nHy> ^BHHy 



This gives what we want with 

#l ^ Sv^ery ^ 2 242 V 37 , ft = 8aj 4 = 2 32 V 26 . 



□ 
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